T\\_t  corr* 


Mathematics  Research  Center 

University  of  Wisconsin-Madison  .  f 

610  Walnut  Street 

Madison,  Wisconsin  53706  t  g 

August  1980 


(Received  June  24,  1980) 


Approved  for  public  release 
Distribution  unlimited 


3 

1 


Sponsored  by 

U.  S.  Army  Research  Office 
p.  0.  Box  12211 
Research  Triangle  Park 
North  Carolina  27709 


8U  iz  Z Z 


(WO  a;  r  r-j.x!/ 


UNIVERSITY  OF  WISCONSIN  -  MADISON 
MATHEMATICS  RESEARCH  CENTER 


£  _JjOTE  ON  AN  EXTENSION  OF  j^IIONAL  j^gUNDS  FOR 

THE" ! t-TAIL  AREA  TO  ARBITRARY  DEGREES  OF  FREEDOM  , 

^  ^ 

IA  Andrew  P.Jsoms  - 


Technical  Summary  Report^2l06 
SS*  August— ISSd - — 


ABSTRACT 

The  bounds  of  Soms  (1977)  for  the  tail  area  of  the  t-distribution  with 
integral  degrees  of  freedom  are  extended  to  arbitrary  positive  degrees  of 
freedom.  Three  numerical  examples  are  provided. 
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SIGNIFICANCE  AND  EXPLANATION 

Real  degrees  of  freedom  arise  in  the  modification  of  the  two-sample 
t-test  when  the  variances  cannot  be  assumed  to  be  equal.  For  small  degrees 
of  freedom  both  linear  interpolation  and  standard  computer  routines  may  be 
unsatisfactory.  The  present  paper  provides  a  simple  technique  for  satis¬ 
factory  estimates  of  tail  probabilities. 


The  respons ih t 1 i tv  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 


A  NOTE  ON  AN  EXTENSION  OF  RATIONAL  BOUNDS  FOR 
THE  t-TAIL  AREA  TO  ARBITRARY  DEGREES  OF  FREEDOM 

Andrew  P.  Soms 


1.  Introduction  and  Statement  of  Extension 

In  appiJ cations ,  such  as  the  Welch  modification  of  the  two- 
sample  t-test  when  variances  cannot  be  assumed  to  be  equal,  it  is 
sometimes  desired  to  evaluate  the  t-tail  area  for  nonintegral 
degrees  of  freedom.  It  is  therefore  of  practical  interest  to 
extend  the  results  of  Soms  (1977)  to  arbitrary  real  degrees  of 
freedom.  While  one  could  use  the  relationship  between  the  t  and 
beta  distributions  indicated  in  Section  3,  this  would  consume 
substantially  more  computer  time  because  of  the  numerical  inte¬ 
gration  and  would  introduce  errors  if  either  limit  of  integration 
was  close  to  0  or  1.  We  introduce  notation  and  state  the  result 
in  this  Section,  Indicate  the  proofs  in  2,  and  give  some  numerical 
examples  in  3. 

For  arbitrary  real  k>0,  let 


f.  (t)  -  c,  (l+t2/k)”(k+1)/2. 


.  .  r((k+l)/2) 

k  T(k/2)  (irk)*5 


Ffc(x)  -  1-Fk(x)  -  fk(t)dt  , 


and 


Rx  "  Fk(*)/C(l+x  /k) f k (x) ]  . 


Also,  for  k>2 ,  let  y  ■  4c?/(l-4c?)  and  y  . 

'max  k  k'  ‘min 


2<k+2)c‘ 

and  y  .  . 
min 

we  have  the  following  extension  of  Theorems  2.1  and  2.2  of 


and  for  k<2.  Interchange  the  definitions  of  y  and  y  .  . 

max  min 


1. 

Then 


Soms  (1977). 

Theorem  1.1:  Let  p(x,y)  «  = - =— r - .  Then 

(x+4ck ( 1+y)  ;*+yx 
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P(K.Vml„)  <  \  <  P(’'-V„ax)  . 


(1.1) 


or, 


equivalently , 


2 

OOp(*,Ymln)  <  Pk(»)  <  (1+I-)fk<*)P<*.’fmax)- 


(1.2) 


This  is  a  generalization  of  Sons  (1977)  where  (1.1),  or  equivalently 

(1.2),  was  shown  to  hold  for  integral  k.  It  was  also  shown  there 

that  if  k  =  2,  Y  «=  Y  “  Y->  and  R  =  p(x,Y«)»  and  hence  this 
max  m  x  xi  z  x 

case  need  not  be  considered  further. 


2.  An  Outline  of  the  Proofs 

We  first  note  that  the  asymptotic  expansion  given  in  Soms 
(1976)  for  the  tail  area  of  the  t-distribution  is  valid  for  arbi¬ 
trary  k>0  even  though  it  was  stated  for  k  a  positive  integer, 
because  this  fact  was  not  used  in  the  derivation.  Hence  the 
heuristic  derivation  of  (1.1)  remains  exactly  as  before.  To 
Insure  that  the  proof  of  Theorem  1.1  given  for  the  integral  case 
works  for  k  real,  all  that  needs  to  be  verified  is  that  the  c, 

K 

satisfy  the  lemmas  in  the  Appendix  and  the  relationships  cf 
Theorems  2.1  and  2.2  of  Soms  (1977).  We  state  these  lemmas  and 
relationships  and  indicate  their  proofs. 

Lemma  2.1:  For  real  k>0,  the  c^  form  an  increasing  sequence  with 
limit  1/ (2^)^. 

It 

Proof :  That  the  limit  is  1  /  ( 2  rr)  is  shown  by  using  Stirling's 

formula.  To  obtain  the  result  it  suffices  to  show  that 
d  2-n  c. 

— rr -  *  g(k)  >  0  for  all  k>0.  By  differentiation, 

aK 

g(k)  -  \  1  *(|)  -  , 

where  V (x)  is  the  digamma  function  (see,  e.g.,  Abramowitz  and 
Stegun,  1965,  p.  258).  From  Artin  (1964,  p.  17), 
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00 

h(k)  -  \  i  7(|)  -  I 


1-0 


(k  +  2 i ) (k+21+1) 


Let  f (x)  -  ( (k+2x) (k+2x+l) )  Then  it  can  be  shown  by  differen¬ 

tiation  that  f  is  convex  and  by  considering  the  area  under  f  (x) 
from  0  to  n  and  letting  n -*■  o"  it  follows  that 

foo 

f(x)dx  +  i  f  CO)  <  h(k)  , 


or 


1  -  k+1  ,  1  1  „  .  s 

2  ln  k  2  k (k+1)  -  h(k)  ’ 


1  1  k+2t 

since  (kT2TjTk+2T+T)'  dt  =  2  £n  k+2t+I*  The  above  is  a  sPecial 

case  of  the  Euler-Mac laur in  summation  formula  and  was  shown  to 
the  author  by  Harris  (1978).  Therefore  it  suffices  to  show  that 

1 


£n  ^  + 


>  I 

k (k+1 )  k  * 


k+1  1 

or,  since  £n  — ~£n(k+l)  -  £n  k  >  that 


>  i 


which 

i  s 

true  for  all 

Lemma 

n 

4+  • 

_2  :  For 

k>0  , 

Proof : 

Follows 

from 

Lemma 

2. 

:  P°r 

k>2  , 

Proof : 

Follows 

from 

Lemma 

2. 

j4 :  For 

k>2  , 

k+1  k (k+1)  -  k  ’ 


2 

:k 


2 

k 


2 

:k 


<  k /  ( 6k+4 ) 

Proof:  Let  h(k)  -  In  (k/6k+4 )  )**  -  £n  c. 


2 

:k 


...  Then  lim  h(k)  -  0, 
k-0 

h(2)  -  0  and  it  must  be  shown  that  h(k)  <  0,  0  <  k  <  2  and  h(k)  >  0, 
k>2.  By  differentiation  and  the  recursion  formula  for  the  7 
function  (see,  e.g.  Abramowitz  and  Stegun,  p.  258), 

7  (x+1 )  -  7(x)  +  1/x, 
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h’  <k) 


-  1  *  11  -  +  1» 


By  differentiation,  it  follows  that  for  k  <  (2  -l)/3  £  .133, 

3k+l 

f  (k)  ■  2  (3k-»-2  )~(k+l')'  is  an  increasinS  function  and  for  k  >  .138 

decreasing.  Also,  from  Artin  (1964,  p.  17)  it  follows  that 
1  k  +  1  k 

g(k)  -  —  (¥(— ^—  +  1)  -  V ( j  +  1))  is  both  a  positive  and  decreas¬ 
ing  function  of  k.  Hence,  for  k  such  that  k^  < k  < kR <  . 138 , 


h’(k)  <  f(kR)  -  g(kR)  ,  (2.1) 

and  for  k  such  that  .138  <  k  <  k  <  k  , 

“  L  “  **  R 

h’(k)  >  f(kR)  -  g(kL)  .  (2.2) 

u 

Also,  since  cR  and  dR  *  (k/(6k+4))^  are  both  increasing  functions 
of  k,  for  k  such  that  kT  < k  <  k_ , 


Using  (2.1), 
. 0- . 1 ( . 05 )  , 
1. 1-14 . 2 ( . 1) 


V'-R  S  ^  i  \ 


-  c. 


(2.3) 


(2.2),  and  (2.3),  it  follows  that  h'(k)  <  0, 
h (k)  <  0,  . 1- . 3 ( . 01) ,  . 3-1. 1  (. 05) ,  and  h’  fk)  >  0, 

.  From  Lemma  2.1,  h(k)  >  0  if  (k/(6k+4))*5  >  (1/2*)^ 


or  k  >  14.2.  Combining  the  above  results  gives  the  conclusion. 

To  complete  the  extension  it  is  necessary  to  prove  the  three 
following  lemmas  used  in  Soms  (1977). 

Lemma  2.5:  For  k>2,  cR  <  k/(4k+8),  and  for  k<2,  cR  >  k/(4k+8). 
Proof :  For  k>2,  k/(4k+8)  >  k/(6k+4)  and  for  k<2,  k/(4k+8)  < 

k/(6k+4),  and  so  using  Lemma  2.4,  the  result  follows. 

Lemma  2.6:  For  k>2,  16kcR  +  (4k+4)cR  >  k  and  for  k<2,  16kcR  + 
(4k+4) cR  <  k. 

Proof :  By  some  algebra.  Lemma  2.6  is  equivalent  to 
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ck  < 


2(~^)  +  +  x))  for  k  <  2-  By  using  the  recur¬ 


sion  formula  for  the  T  function  this  is  further  equivalent  to 


r2(^i  +  i) 

hl(k)  "  k+l  Tf -  <  h2Ck) 

1  k+l  I  ^(|  +  !)ir  L 


2k  2  *  > 

<1+(t+T)  >  +1  for  k  <  2 


Let  g  (k)  =  in  h.  (k)  -  Jen  h,(k).  Then  lim  g(k)  -  g(2)  =  0  and  we 
1  1  k-*-0 

must  show  that  g(k)  <0,  0  <  k  <  2,  and  g(k)  >  0,  k  >  2.  By  differen¬ 


tiation  , 


g'  (k) 


_  —L_  +  [v 
k+l  L 


C^+  l) 


2k  2  %(  2k  2  ^  I 

(I+Cf^-)2)  ci  +  cgfr)  )  +1 


-  y(j  +  i>J 


(k+l) 


k+l  +  81^  +  g2^  * 

Now  g'  (k)  <  -  +  g^(k)  +  2k  and  for  k  such  that  kL  <  k  <  kR> 

g’  (k)  <  -  — ■  ^  +  gL(kL)  +  2kR.  Also,  since  for  k  >  1,  g2(k)  is  a 

decreasing  function  of  k,  we  have  g'  (k)  >  -  R  ^  +  g^(kR)  +  82(kR) 

L 

for  l<k^<k<kR.  Hence  g  ’  (k)  <  0 ,  .0-.15(.01),  g(k)<0, 

.15-.25  (.001)  and  .  25-1 . 60  (  .  01)  ,  g'(k)>0,  l.-3.4(.l),  g(k)>0, 

3. 4-5. 0(. 01)  and  5.0-17(.l)  and  since  16k  c£  +  (4k+4)c2  > 

16k  cR  +  4k  c2  =  k(16cR  +  4c2)  and  16c^2  +  4c2^  >  1»  g(k)  >  0, 
k  >  17.  Combining  the  above  yields  the  result. 

t _ O  2  .  (k)  (k+4 )  2  „  (k)  (k+4 ) 


Lemma  2.7:  For  k>2,  c,  > 


(k) (k+4 )  2  (k) (k+4) 

8 (k+l) (k+2)  d  for  k  2*  ck  8 (k+l ) (k+2 ) 


Proof :  By  using  the  recursion  formula  for  the  T  function,  it  is 


equivalent  to  show 


hl(k) 


r2(^  +  i) 
r2(|+  i) 


>  v  ,Vs  .  (k+l)  (k+4) 
<  h2U>  8  (k+2  ) 


Let  g(k)  ■  Jin  h^(k)  - 


in  h,(k).  Then  lim  g(k) 
“  k-0 


0  and  g(2) 


11  -P'1 


By  differentiation,  g’(k) 


'( 


k+1 


*1)  -  T(y  +  1 )  +  yyj 


(JL  -J-) 

v  K  +  1  kii; 


g^(k)  -  g  0  ( k )  .  Since  g  ^  ana  g  0  are  decreasing  function  s  of  k  and 

V 

a(k)  «=  (k)(k+4)/(8(k+l)(k+2))  is  increasing  for  k  <  (12)  ‘+2  =  5.46 

we  have  for  k  such  that  k  <  k  <  k  , 

L  "*■  K 


and 


c2  -  a  (k)  <  -  a(kL),  kR  <  5.46 


gl(kR)  "  g2(kL)  -  p’(k)  -  Rl(kL}  ~  S2(kR) 


(2.4) 


Using  (2.4),  we  have  g’(k)  <  0  ,  ,0-.45  (.05),  g(k)  <  0,  . 4 5 -1  .  15  ( .  0 1 )  , 
g’(k)  >  0,  1. 15-9.05C.01)  and  g  (k)  >  0  for  k  >  9  since 

and  the  latter  is  true  for  k  =  9. 


2  (k) (k+4) 

k  8 (k+1) (k+2) 


if  c2  >  J£l— 

k  8 (k+2 ) 


Combining  the  above  concludes  the  proof. 

All  the  calculations  involved  in  the  proofs  were  done  on  a 
computer  using  F  and  Y  routines. 


3.  Some  Numerical  Examples 

Suppose  k  -  .  2  and  x  “  4  .  Then  using  Theorem  11,  we  find  that 

c  =  .19748,  y  =  .16557,  p(4,y  )  =  .24859,  y  -  .18482, 

t  m  3  x  m  3  x  id  i  n 

p(4,y  .  )  =  .24857  and  .28468  <  F  «(4)  <  .28471.  The  value  giver, 
min  •  JL 


by  a  computer  routine  using  the  relationship  7  (t) 


.  5  * 


5  1(1/2  ,k/2)  ,  x  =  t  /k/(l+tVk),  is  .28471.  If  k  -  2.5  and  x 

then  c0  ,  =  .361809,  y  =  1.099178,  p(2,y  )  =  .445701, 

2.5  max  max 

r  ,  =  1.121967  ,  p  (2 , y  .  )  =  .  445287  and  .07863  <  F,  e(2)  < 

min  min  /  .  5 

The  c omputer  value  is  .078  70  .  Finally,  if  k  =  .  1 ,  x  -  2 5  ,  they. 

‘a  -  •!*»«*.  Y..X  •  •«*»*«.  »(»,»„)  -  .039997,  -  .09M61, 

p(25,y  .  )  ■=  .039997  and  .  30251  <  7  ,C25)  <  .30251.  The  c.’hhuE.r 
value  is  .30251.  Th’  above  statements  of  course  reflect  rounding. 

These  numerical  examples  indicate  that  Theorem  1.1  would  b e 
well  suited  to  a  small  sample  Monte  Carle  stud-.'  of  the  Welch  t-test 
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